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Abstract — Optimal and suboptimal decentralized estimators 
in wireless sensor networks (WSNs) over orthogonal multiple- 
access fading channels are studied in this paper. Considering 
multiple-bit quantization before digital transmission, we develop 
maximum likelihood estimators (MLEs) with both known and 
unknown channel state information (CSI). When training sym- 
bols are available, we derive a MLE that is a special case of 
the MLE with unknown CSI. It implicitly uses the training 
symbols to estimate the channel coefficients and exploits the 
estimated CSI in an optimal way. To reduce the computational 
complexity, we propose suboptimal estimators. These estimators 
exploit both signal and data level redundant information to 
improve the estimation performance. The proposed MLEs reduce 
to traditional fusion based or diversity based estimators when 
communications or observations are perfect. By introducing 
a general message function, the proposed estimators can be 
applied when various analog or digital transmission schemes 
are used. The simulations show that the estimators using digital 
communications with multiple-bit quantization outperform the 
estimator using analog-and-forwarding transmission in fading 
channels. When considering the total bandwidth and energy 
constraints, the MLE using multiple-bit quantization is superior 
to that using binary quantization at medium and high observation 
signal-to-noise ratio levels. 



I. Introduction 

Wireless sensor networks (WSNs) consist of a number 
of sensors deployed in a field to collect information, e.g., 
measuring physical parameters such as temperature and hu- 
midity. Since the sensors are usually powered by batteries and 
have very limited processing and communication abilities Ul, 
the parameters are often estimated in a decentralized way. 
In typical WSNs for decentralized estimation, there exists a 
fusion center (FC). The sensors transmit their locally processed 
observations to the FC without inter-sensor communications, 
and the FC generates the final estimation based on the received 
signals [2|. 

Both observation noise and communication error deteriorate 
the performance of the decentralized estimation. Traditional 
fusion based estimators are able to minimize the mean square 
error (MSE) of the parameter estimation by assuming perfect 
communication links (see |J3) and references therein). They 
reduce the observation noises by exploiting the redundant 
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observations provided by multiple sensors. However, their 
performance will degrade dramatically when communication 
errors cannot be ignored or corrected. On the other hand, var- 
ious wireless communication technologies aimed at achieving 
transmission capacity or improving reliability cannot neces- 
sarily minimize the MSE of the parameter estimation. For 
example, although diversity combining reduces the bit error 
rate (BER) of communications, it requires that the signals 
transmitted from the sensors are identical, which is not true 
in the context of WSNs due to the observation noises ap- 
peared in sensors. This motivates the joint optimization of 
the communication-oriented diversity combination and data 
fusion-oriented estimator at the FC under realistic observa- 
tion and channel models, which uses MSE of the parameter 
estimation as the performance metric. 

The bandwidth and energy constraints are two most impor- 
tant issues that are addressed in WSNs. When strict bandwidth 
constraint is taken into account, the decentralized estimation 
when the sensors only transmit one bit (binary quantization) 
for each observation is studied in |4|-|9|. Among them, |4j, 
[|5l introduce the maximum likelihood estimation (MLE) and 
discuss the optimal quantization when the communication 
channels are noiseless. Also considering noiseless channels, 
[6| proposes a universal and isotropic quantization rule, and 
ID, (]9l study the adaptive binary quantization methods. When 
channels are noisy, fT^ studies the MLE in additive white 
Gaussian noise (AWGN) channels and introduces several low 
complexity suboptimal estimators. It has been found that the 
binary quantization is sufficient for decentralized estimation 
at low observation signal-to-noise ratio (SNR), whereas the 
sensors need to transmit a few extra bits when the observation 
SNR is high [4|. When the energy constraint and general multi- 
level quantizers are considered, the decentralized estimation 
is studies under various channels. When communications are 
error-free, quantization at the sensors are designed in [10|- 
[14]. The optimal trade-off between the number of active sen- 
sors and the quantization bit-rate of each sensor is investigated 
under total energy constraint in |15|. In binary symmetrical 
channels (BSCs), the power scheduling is proposed to reduce 
the estimation MSE when the best linear unbiased estimator 
(BLUE) and a quasi-BLUE, where quantization noise is taken 
into account, are used at the FC |16|, [tTT). To the best of 
the authors' knowledge, the optimal decentralized estimator 
using multiple-bit quantization in fading channels is still not 
available. Although the MLE proposed in AWGN channel 
Q can be applied for fading channels if the channel state 



information (CSI) is known at the FC, it only considers binary 
quantization. 

Besides the decentraUzed estimation based on digital com- 
munications, the estimation based on analog communications 
receives considerable attentions due to the important con- 
clusions drawn from the studies in multi-terminal coding 
problem IfTSl . ||T9l . The most popular scheme is amplify-and- 
forward (AF) transmission, which is proved to be optimal in 
quadratic Gaussian sensor networks with AWGN multiple- 
access channels (MACs) 1201 . The power scheduling and 
energy efficiency of the AF transmission are studied under 
AWGN channels in [21 J and [22J. It shows that the AF 
transmission is more energy-efficient than that of digital com- 
munications with certain coding and modulation schemes. In 
fading channels, the AF transmission is not optimal any more 
both in orthogonal MACs Il23l - ll25l and in non-orthogonal 
MACs |26|. The outage laws of the estimation diversity with 
AF transmission in fading channels are studied in |24) and [23 
in different asymptotic regimes. These studies, especially the 
results in f23l, indicate that the separate source-channel coding 
scheme outperforms the AF transmission, which is a simple 
joint source-channel coding scheme, and is actually optimal 
in fading channels with orthogonal multiple-access protocols. 

In this paper, we develop the optimal and suboptimal decen- 
tralized estimators for a deterministic parameter considering 
digital communication systems. The observations of sensors 
are quantized, coded and modulated, then transmitted to the FC 
with orthogonal MACs over Rayleigh fading channels. Uni- 
form quantization is used since it is optimal for deterministic 
parameters. Because the binary quantization is only applicable 
for low observation SNR levels ID, ifTSJI . a general multi-bit 
quantizer is considered. 

We strive for deriving the MLE and feasible suboptimal 
estimator when different local processing and communication 
strategies are used. To this end, we first present a general 
message function to represent various quantization and trans- 
mission schemes. We then derive the MLE for an unknown 
parameter with known CSI at the FC. In typical WSNs, the 
sensors usually cannot transmit too many training symbols 
for the receiver to estimate or track channel coefficients due 
to both energy and bandwidth constraints. Therefore, we will 
also consider the case that the CSI is unknown at the FC when 
no or only a few training symbols are available, which is of 
practical significance. In order to reduce the computational 
complexity, we will introduce suboptimal estimators following 
the hint provided by the structure of the MLEs. 

Our contributions are summarized as follows. We develop 
the decentralized MLEs with known and unknown CSI at the 
FC over orthogonal MACs with Rayleigh fading. The perfor- 
mance of the MLEs serves as the practical performance lower 
bounds of the decentralized estimation in orthogonal MACs. 
To provide feasible estimators with affordable complexity, 
we propose a suboptimal algorithm, which can be viewed 
as modified expectation-maximization (EM) algorithm lIZTJI . 
By studying the special cases for error-free communications 
or noiseless observations, we show that the MLEs degenerate 
into the well-known centralized fusion estimator — BLUE, or 
the maximal ratio combiner (MRC) based estimator when 



CSI is known and a subspace based estimator when CSI is 
unknown. This indicates that our estimators can exploit both 
data level redundancy and signal level redundancy provided 
by the multiple sensors when both observation noises and 
communication errors are present. By introducing a general 
message function that can describe various quantization and 
transmission schemes, the proposed decentralized estimators 
can also be applied for the WSNs where AF transmission 
or digital transmission with binary quantization are used. 
Therefore, our estimators can bridge the gap between the 
estimators using two extreme case quantization. 

The rest of the paper is organized as follows. Section 
mi describes the system models we considered. Section [III] 
presents the MLEs with known and unknown CSI, and Section 
IrVl introduces the suboptimal estimators. In Section [V] we 
analyze several special cases of the MLEs. In Section [Vll 
we discuss the codebook design issue, the computational 
complexity, and the asymptotic performance of the presented 
MLEs. Simulations are provided in Section IVIII and the 
conclusions are given in Section IVIIII 

II. System Model 

We consider a typical kind of WSNs that consists of N 
sensors and a FC to measure an unknown deterministic param- 
eter 9, where there are no inter-sensor communications among 
the sensors. The sensors transmit their quantized observations 
to the FC over Rayleigh fading channels. Assume that the 
sensors use ideal orthogonal multiple-access protocols, such 
as TDMA and FDMA, to transmit their signals to the FC. 
Then the FC can separate the received signals from different 
sensors without inducing interference. 

Figure[T]is a diagram of the decentralized estimation system 
considered. The sensors process their observations for the 
parameter 6 before transmission. For digital communications, 
the processing includes quantization, channel coding and 
modulation, etc. For analog communications, the processing 
may be simply amplification before transmission. A function 
c(x), named as messaging function, is used to describe the 
local processing for both digital and analog communication 
systems. The transmission signals of the sensors arrive at the 
FC through independent Rayleigh fading channels, and the FC 
uses the received signals to estimate 6. 
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Fig. 1 . The diagram of the decentralized estimation system we considered. 

In the subsequent sections, we will first derive the decen- 
tralized estimators considering digital communication, then 
extend the results to the case using analog communication. 



A. Observation Model 

The observation for the unknown parameter 6 provided by 
the i-th sensor is, 



Xi = 6 + ns^t, i = 1,- ■ ■ ,N, 



(1) 



where n^ ^ is the independent identically distributed (i.i.d.) 
Gaussian observation noise with zero mean and variance a'^, 
and 9 is bounded within a dynamic range [— V, +V]. 



B. Quantization, Coding and Modulation 

We use the messaging function c(2:)|M — > C'^, which maps 
the observations to transmission symbols, to represent all 
the processing at the sensors including quantization, coding 
and modulation. To facilitate the analysis, the energy of the 
transmission symbols is normalized to 1, 



c{x)^c{x) = 1, Vx e 



(2) 



We consider the uniform quantization, which is optimal 
for deterministic parameters or for parameters with unknown 
statistics. For an M-level uniform quantizer, define the granu- 
lar region of the quantizer as [— W, +Vl^], then all the possible 
quantized values of the observations can be written as. 



S'„ = mA-M^, m = 0,---,M-l, 



(3) 



where A — 2W/{M — 1) is the quantization interval. 

The observations are rounded to the nearest Sm, therefore 
c{x) is a piecewise constant function described as. 



Co, 
c{x) = ( Cm, 



— oo < X < So + ^ 

'Jm "2" *^ "^ — ^m I "2" 5 



(4) 



cm-1, Sm-1 - ^ < X < +CX) 



where Cm = [cm.i,--- ,Cm,L]'^ is the L symbols corre- 
sponding to the quantized observation Sm to be transmitted, 
m = 0, • • • , A/ - 1. 

Under the assumption that W is much larger than the 
dynamic range of 6, the probability that \xi\ > W can be 
ignored. Then c(x) is simplified as, 

c(.t)=c„i, Sm- -;r < X < Sm + -Z-- (5) 

Define the transmission codebook as, 

Q = [co,---,CM-i]eC^^*^ (6) 

which can be used to describe any coding and modulation 
scheme following the Af -level quantization. 

The sensors can use various codes such as natural binary 
codes to represent the quantized observations. Since the focus 
of this paper is to design the decentralized estimators, we 
will not optimize the transmission codebook for the parameter 
estimation. 



C. Received Signals 

Since we consider orthogonal MACs, we assume that the 
FC can perfectly separate and synchronize to the received 
signals from different sensors. Assume that the channels are 
block fading, i.e., the channel coefficients are invariant during 
the period that sensors transmit L symbols representing one 
observation. After matched filtering and symbol-rate sampling, 
the L received samples corresponding to the L transmitted 
symbols from the i-th sensor can be expressed as. 



y* = \/£dhic{xi) + nc,i, i = 1, • • • ,N, 



(7) 



where y^ — [yi^i,-'' jJ/i.l]"^^ ^i is the channel coefficient 
subjecting to complex Gaussian distribution with zero mean 
and unit variance, rici is a vector of thermal noise at the 
receiver subjecting to complex Gaussian distribution with zero 
mean and covariance matrix cr^I, and £d is the transmission 
energy for each observation. 

III. Optimal Estimators with or without CSI 

In this section, we derive the MLEs when CSI is known 
or unknown at the receiver of the FC, respectively. The MLE 
using training symbols in the transmission codebook is also 
studied as a special form of the MLE with unknown CSI. 

A. MLE with Known CSI 

Given 9, the received signals from different sensors are 
statistically independent. If the CSI is known at the receiver 
of the FC, the log-likelihood function is, 

N 



\ogp{Y\h,9)^J2^ogpiy,\K,9) 

1=1 

= J2^ogU piy,\h,,x)pix\9)dx] (S) 

where h = [hi,- ■ ■ , Hn]"^ is the channel coefficients vector, 
and p{x\9) is the conditional probability density function 
(PDF) of the observation given 9. Following the observation 
model shown in ^, we have. 



p(x|i 



1 



27rcr< 



■ exp 



{x- 



2<7i 



(9) 



According to the received signal model shown in (|7]), the 
PDF of the received signals given CSI and the observation of 
the sensors is. 



p{yt\ht,x) 



1 



(^^D 



2\L 



exp 



|yi - V£dhic{x)\\l 



, (10) 



where ||z||2 = (z^z)^/^ is I2 norm of vector z. 

Substitute © and ([Toll to (HJ, then the log-likelihood 
function becomes. 



logp(Y|h, 



^ log f / exp 

\\yi ~ \f£dhic{x)\^\ 



{x-9f 



(11) 



where a = log 



"Sct^ct? 



affect the estimation. From now on, we will omit the constant 
when we write likelihood functions for simplicity. 

Now we consider the form of the likelihood function for 
digital communications, where c(x) is a piecewise constant 
function described in (|5]). Substituting (|5]i to (fTTT i. we have. 



is a constant that does not Following the Matrix Inverse Lemma, we have. 



Ry^ — I 



£d 



-c{x)c{x)^. 



Substituting ( fT9l ) and ( |20| ) to ( fTTI i. p(yi|a;) becomes, 

|y,||2 £rfyHc(a;)c(x)Hy, 



(20) 



p(yj|a;) =aexp 



N 



logp(Y|h,0)=^log 



p(yi|^i,c„0p(S'„i|6') 



: aexp 



a2 a2(^^ + ^2 

ly.lli £,|yHc(x) 



(12) 

where p{yi\hi,Cm) is the PDF of the received signals given 
CSI and the transmitted symbols of the sensors, which is. 



al <rl{£a + crl) 



2\ ; ' 



(21) 



where a 



TT^Cfd + O-^W 



jTTTy IS a constant. 



p(yi\hi, Cr, 



1 



{^'^D 



2\L 



exp 



f£dhiC 



m\\2 



(13) 



and p{Sm\0) is the probability mass function (PMF) of the 
quantized observation given 6, which is. 



On 



A 
2 



PiSm\e) = Q\ 

\ <-'s 

where Q(a;) = -== /Jj" exp 



The MLE that maximizes (I12ll is then 




(14) 



Upon substituting i2l\ and (O to (fTST l. the log-likelihood 
function becomes, 

logp(Y|0) = f] log (^I^J exp (-^^^- 

lly.lli ^ f,|yHc(a:)p^_^^^ ^22^ 



(23) 



a2 a2(£, + a2) 

Then, the MLE is obtained as, 

6 — argmaxlogp(Y|0). 



log rL exp ( 




A 
2 



0„ 



A 
2 




(15) 



The log-likelihood function in ( fTSl l is non-concave and has 
multiple extrema. It is difficult to find a closed-form expression 
of 6 or to compute 9 using high efficient numerical methods. 

B. MLE with Unknown CSI 

When the CSI is unknown at the FC, the log-likelihood 
function is, 

logp(Y|0) = ^ ("log ("y p{y,\x)p{x\e)dx)Y (16) 

which has a similar form to the likelihood function with known 
CSI shown in ^. 

According to the received signal model shown in (ITJ, given 
X, Yi subjects to zero mean complex Gaussian distribution, 
i.e., 

P{yt\^) = _r. j.^T> exp (-yf R- Vj) , (17) 



C. MLE with Unknown CSI using Training Symbols 

In typical communication systems, the transmitted symbols 
may consist of training symbols to facilitate channel estima- 
tion. In this subsection, we will analyze the MLE for such 
transmission schemes. 

Define Cp as a vector which consists of Lp training symbols. 
Each transmission for an observation will begin with transmit- 
ting Cp, then transmitting the data symbols defined as Cd{x). 
Thus the messaging function becomes. 



(24) 



c(x) = I 7 . 



Upon substituting this expression to (1221) and ignoring all 
the terms that does not affect the estimation, we obtain the 
likelihood function as. 



\ogpiY\e) = J2 log (/ °° <^^P (-^^^+ 

(3\y'i^dCd{x)\^+2mCpy^.pyudCd{x)})dx 



TT^ det Rv 



where Ry is the covariance matrix of y^, which is, 

Ry =cr^i + £^c(a;)c(x)". (18) 

It is readily to find that one eigenvalue of Ry equals to 
£d + o'c^ and all other eigenvalues equal to cr^. Thereby the 
determinant of Ry is. 



(25) 

where y^.p and y^ ^ are the received signals corresponding 
to the training symbols and data symbols, respectively, and 
/3 = £d/(j1{£d + CTc) is a constant. 

Now we show that c^ y^.p in (|25] l can be regarded as the 
minimum mean square error (MMSE) estimate for the channel 
coefficient hi except for a constant factor Since both hi and 
the receiver thermal noise are complex Gaussian distributed, 
the MMSE estimate of hi is equivalent to its lineai" MMSE 
estimate, which is. 



hi — (Rj, Vyh) yi,p, 



(26) 



det Ry — {£d + cr^)cr. 



2W2(L-1) 



where r„^ = E[yi ph*], and R„ is the covariance matrix of 



(19) 



According to the received signal model, we have 

Tyh = E[{^/£^h^Cp + nci,p)h*] = \/SdCp, (27) 

and 



Rv„ = SdCvC^ + CT^I- 



^Vp 



d^p^p 



(28) 



With the Matrix Inverse Lemma, R ^ is expressed as 



'Vp 



^yp 



I 






(29) 



Upon substituting (|28Jl and (|29l) to (|26Jl, the MMSE channel 
estimate becomes. 



(30) 



TdL 



Lal + LpEa''^^'^'- 



Let K = J .y % „ , then we have cj^y, „ — hi/ n. Substi- 
tuting it to ( l25l l. we obtain. 



logp(Y|6') = ^ log f / exp 

i=l \J-oo 






+ 



2/3. 



P\yldU^)? + ^3?{/i.yH,c,(a;)} ) dx 



(31) 



In the following, we will show that the MLE in this case is 
equivalent to a two-stage estimator During the first stage, the 
FC uses (l30l l to obtain the MMSE estimate of hi. It can be 
modeled as hi = hi + tin, where th. is the estimation error 
subjecting to complex Gaussian distributed with zero mean 
and variance L(T^/{La'^ + Lp£d)- During the second stage, 
the FC conducts the MLE using hi. 

Substituting hi to (|7]i, the received signal of the data 
symbols becomes, 

~ (32) 



YiA = \/£dhtCd{x) - \/£dth,Cd{x) + ncj,d 

where rici.d is the receiver thermal noise. 

Deriving the conditional PDF p{yi^d\hi,x) with (|32] |. we 
can obtain a likelihood function which is exactly the same 
as (ISTl i. This implies that the MLE with unknown CSI can 
exploit the available training symbols implicitly to provide 
the optimal channel estimation and then use it to provide the 
optimal estimation of 9. 

Note that the likelihood function in dSTT i is different from 
the likelihood function that uses the estimated CSI as the true 
value of the channel coefficients, which is. 



logp(Y|/i, = /i„0) = > Jog 
l^/Td 



N 



cxp 



{x-ef 



2al 



■+ 



'^{hyfdCdix)} Ax 



(33) 



This is a coherent estimator. By contrast, there exist both 
a coherent term ^{hiyf^Cd{x)} and a non-coherent term 
\yf^Cd[x)\'^ in dsTI ). This means that the MLE shown in ( |3T] | 
uses the channel estimate as "partial" CSI after accounting for 
the channel estimation errors. The true value of the channel 



coefficients contained in the channel estimation corresponds 
to the coherent term of the log-likelihood function, whereas 
the uncertainty in the channel estimation, i.e., the estimation 
errors, leads to the non-coherent term. We will compare the 
performance of the two estimators through simulations in 
Section IVn] 

IV. SuBOPTiMAL Estimator 

In the previous section, we developed the optimal estimators 
for the considered decentralized estimation systems, which 
are not feasible for practical systems due to their prohibitive 
computational complexity. Nevertheless, their performance can 
serve as the practical upper-bound when both observation 
noise and communication error are present, and their structure 
provides us some hint to derive low complexity suboptimal 
estimators. In this section, we take the suboptimal estimator 
with known CSI as an example. The estimator with unknown 
CSI can be obtained following the same principle. 

We first consider an approximation of the PMF p{S„i\d)- 
Following the Lagrange Mean Value Theorem, there exists ^ 
in an interval f^^^ — - — , -^ — - — 1 that satisfies. 



p{s„r\e) 



no 



A 



27rCT, 



■ exp 



2 



(34) 



If the quantization interval A is small enough, we can let ^ 
equal to the middle value of the interval, i.e., ^ = {S,n — S)/as, 
and obtain an approximate expression of the PMF as, 

A / {s,n-o': 



p{Sm\e) ^ PA{S^m\0) 



27rCT< 



■ exp 



2a2 



(35) 



Substituting (l35T l to (fT2b and computing its partial derivative 
with respect to 9, the likelihood equation is simplified as. 



1 ^ 

-y 



M-l 



Piyi\hi, C.m)pA{Sr,i\9)Sj 



M-l 



\ 



J^p^y^ 



i)PA{Sm\9) 



\ 



J 



(36) 



which is the necessary condition for the MLE of 9. 

Unfortunately, we cannot obtain an explicit estimator for 9 
from this equation because the right hand side of the likelihood 
equation also contains 9. However, considering the property of 
the conditional PDF, 

N (M-\ 

Yi,hi,9)S„ 



(37) 




i=l 



If we assume that 9 in the right hand side of the likelihood 
equation is known, the right hand side of dJTJ l is actually the 
MMSE estimator of £',„., i.e., Sm- ~ E \Sm- \yi,hi, 9]. This in- 
dicates that the MLE can be regarded as a two stage estimator. 
During the first stage, it estimates Sm^, i — 1,- ■ ■ ,N, with the 
received signals from each sensor. During the second stage, it 
combines 5^; by a sample mean estimator These two stages 
are consistent with the two steps of the EM algorithm ll27l . 



The first stage is the expectation step (E-step) and the second 
stage is the maximization step (M-step) of the algorithm. The 
set of the quantized observations Smt , which is the sufficient 
statistics for estimating 9, is the complete data of the EM 
algorithm. 

We present a suboptimal estimator with a similar two-stage 
structure. This estimator can be viewed as a modified EM 
algorithm. Because the likelihood function shown in ( fTSl i has 
multiple extrema and the equation shown in ( [36] l is only a 
necessary condition, the initial value of the iterative computa- 
tion is critical to the convergence of the iterative algorithm. To 
obtain a good initial value, the suboptimal estimator estimates 
Srm by assuming it to be uniformly distributed. Furthermore, 
since the estimate quality of the first stage is available, we 
use BLUE to obtain 9 for exploiting the quality information, 
instead of using the MLE in the M-step as in the standard EM 
algorithm. 

During the first stage of the iterative computation, the 
suboptimal algorithm estimates S„n under MMSE criterion. 
This estimator requires a priori probability of 5m. which 
depends on the unknown parameter 9. The initial distribution 
of Smi is set to be uniform distribution. After obtained a 
temporary estimate of 9, we can apply it to update the a priori 
probability of Srm ^i^^ estimate S^m iteratively. The MMSE 
estimator during the first stage is as follows. 



(|38T l is unbiased because. 






M-1 



y^ p{yi\hi, Sm,)p{SrnJdyi 
m=0 
» J\/-l 

/ y^ P{yi I hi , S,n, )p{Srm ) Srrn dXi 



m=0 



M-1 



= ^ p{S.m,)Sr, 



m=0 



'-nSra,\h,]. 



(39) 



However, p{S„n) in our algorithm is inaccurate since we 
use 9 instead of 6. The MMSE estimate may be biased, but it 
is hard to obtain this bias in practical systems. We regard the 
MMSE estimator as an unbiased estimate in our suboptimal 
algorithm. 

Given hi and y^, the variance of the MMSE estimate can 
be derived as. 



Y&T[Sm,\h,,y,]^E[Sl\h,,y,]-E^[Srm\h,,yi 



^M 



Em,=0 S?aMy^\h^' Sm,)p{Sm,) 



,M 



Y.m,=oP'yy^\hi^ ^rni)p{Sn 



d2 

(40) 



b„ 



E[S'™J/ij,yj 



M 



y^ p{Smi\hi,yi)Smi 

M 
y^ P{yi\hi, SmJpiSmi)S„ 

M 

^ p{yt\hi, Sjni)p{Smi) 
mi=0 



(38) 



where p{yi\hi, S,n,) equals p{yt\hi, c.^) in ([T3]l, and p{S,n,) 
is the estimate for a priori PDF of S„ii ■ Obtained 9, we use 

PA{Srni\S) to update p{Smi), i.e., let p(S',„J = PA{Smi\d). 

In this section, we omit 9 in p{Smi) for notational simplicity, 
though it depends on 9. 

Now we derive the mean and variance of Sm-, which will 
be used in the BLUE of 9. 

If p{Smi) equals to its true value, the MMSE estimator in 



Then the BLUE for estimating 9 is. 



N 

E 



-{Gl+Ym:[Sm,\hj,yj] 



Orr 



—^ 0-2 +Ya.r[SmAhi,yi] 



(41) 



The iterative algorithm can be summarized as follows: 

51) Let p{Sm,} = ^/AI as the initial value. 

52) Compute S„i^, i = 1, • • • ,N, and its variance with ( |38] ) 
and (gOll. 

53) Substitute Srm ^i^^ i'^s variance to (l4Tl i to get 9. 

54) Update p{Sm,) using PA{Srm\(^)- 

55) Repeat step S2) ^ S4) until the algorithm converges or 
a predetermined number of iterations is reached. 

Note that this suboptimal algorithm differs from the one 
proposed in [7], which applies the maximal a posteriori 
(MAP) criterion to detect binary observations of the sensors, 
then uses the results as the true value of the observations in the 
MLE derived in noise-free channels. Our suboptimal algorithm 
inherits the structure of the MLE developed in fading channels, 
which gives "soft" estimates of the quantized observations 
at first, and combines them with a linear optimal estimator 
afterward. By conducting these two stages iteratively, the es- 
timation accuracy improves rapidly. Although the suboptimal 



algorithm may converge to some local optimal solutions due 
to the non-convexity of the original optimization problem, it 
still performs fairly well as will be shown in the simulation 
results. The convergence of the algorithm will be studied by 
simulations in Section FVlIl 



and its only maximum can be obtained by solving the equation 
dlogp{Y\h,e)/dc{e) = 0, which is. 



V. Special Cases of the MLEs 

To gain some insights on the decentralized MLE, in this 
section, we first study two special cases of the MLEs when 
either the observations of the sensors or the communications 
are perfect. After that, we discuss the form of the MLE with 
known CSI using two extreme case quantization, which are the 
AF transmission (infinitesimal quantization resolution) and the 
1-bit quantization (rough most resolution). This will provide 
the connections of the derived MLE with existing well-studied 
optimal estimators in these special cases. 

A. Ideal Observations and Ideal Communications 

Considering the approximate expression of the PMF shown 
in ( |35] |. the likelihood equation with known CSI is approxi- 
mated as. 



logp(Y|h, 



N / /M-1 

= H log H cxp 

i=l \ \m=0 



|y, - VSdhiCr, 



(42) 



This tells us that the MLE exploits both the signal level 
information y^ and the data level information Sm when the 
quantization interval is small enough. If the observations are 
perfect, say, no observation noises, we will show that the 
MLE degenerates into a signal level optimal combiner — MRC 
followed by data demodulation and parameter reconstruction. 
On the other hand, if the communications are perfect, we will 
show that the MLE reduces to a data level optimal fusion 
estimator — BLUE. 

When CSI is unknown, we draw similar conclusions, except 
that the MLE becomes a subspace-based estimator followed 
by data demodulation and parameter reconstruction when there 
are no observation noises. 

1) Noiseless Observations: When the observations of the 
sensors are ideal, i.e., Xi — d,yi ~ 1,- ■ ■ ,N, we have. 



p{x\0) = s{x-e), 



(43) 



where 6{x) is the Dirac-delta function. 

We first consider the MLE with known CSI. Substituting 
(|43l l to (IHJ and ignoring all terms that do not affect the 
estimation, the log-likelihood function is simplified as. 



logp(Y|h,( 



N 

4 = 1 



^h,c{e)\\ 



(44) 



Since c{d) is a piecewise constant function that is not 
derivable, we cannot compute the partial derivative of (l44l i 
with respect to 9. Instead, we first regard c{9) as the parameter 
under estimated and obtain the MLE for estimating c{9). The 
log-likelihood function in ( |44] | is concave with respect to c{6) 



m^ 



1 



N 



'^E,=i 



E^^ 



y*- 



(45) 



i=l 



Then we can use it as a decision variable to detect the trans- 
mitted symbols and reconstruct 6 according to the quantization 
rule with the detection results. 

It shows that when the observations are perfect, the structure 
of the MLE is the MRC concatenated with data demodulation 
and parameter reconstruction. This is no surprise since in this 
case the signals transmitted by different sensors are all identi- 
cal, the receiver at the EC is able to use the traditional diversity 
technology to reduce the communication errors. Meanwhile, 
it is unnecessary to use the redundant observations for data 
fusion. 

We then consider the MLE with unknown CSI. Upon 
substituting ( |43] | to ( fTSI l and ignoring all terms that do not 
affect the estimation, the log-likelihood function becomes, 

N 

logp{Y\0) = 5]|y?c(0)p 

i=l 

= c(0)"fEy.y?jc(0). (46) 

Again, we regard c(^) as the parameter to be estimated. 
Recall that the energy of c{9) is normalized. Then the problem 
that finds c{d) to maximize ( |46T l is a solvable quadratically 
constrained quadratic program (QCQP) I.28J , 



N 



max 

c{e) 

s.t. 



cwMEy^yn-w 



\j=i 



1. 



(47) 



The solution of (l47T i can be obtained following the results 
about QCQP in 



(48) 



C{d) ^V,nax ( E^'^n 



where VmaxC^) is the eigenvector corresponding to the 
maximal eigenvalue of the matrix M. 

This shows that when CSI is unknown at the EC in the 
case of noise-free observations, the MLE becomes a subspace- 
based estimator. 

2) Noiseless Communications: When a^ — > 0, we have 
Yi — V^dhiCnii- It means that y.; is merely decided by Cm^, or 
equivalently decided by Smt ■ Then the log-likelihood function 
becomes a function of the quantized observation Smt ■ 

We first consider the MLE with known CSI. The log- 
likelihood function with ideal communications is. 



logp(Y|h, 61) -^logp(S|h, 6*) 
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(49) 



1=1 \ \ 
where S — \S„ 



J *3mjvJ 



Computing the derivative of (|49T l. we have the likelihood 
equation. 



N exp ( 



(S„ 



-) - cxp i^- 



(S„ 



q{- 



(50) 

Generally, this likelihood equation has no closed-form so- 
lution. Nonetheless, the closed-form solution can be obtained 
when the quantization noise is very small, i.e., A ^- 0. Under 
this condition, Sm- — > Xi and ( fSOl l becomes. 



lim 



giogp(S|h,( 

de 



N 

E' 



0. 



4=1 



The MLE obtained from JSTT ) is. 



1 ^ 



(51) 



(52) 



It is also no surprise to see that the MLE reduces to BLUE, 
which is often applied in centralized estimation fTT|, where 
the PC can obtain all raw observations of the sensors. 

When the CSI is unknown at the PC, the receiver of the PC 
can recover the quantized observations of the sensors with 
error-free if the proper codebook, which will be discussed 
in Section IVI-AI is applied. Then the MLE with unknown 
CSI also degenerates into the BLUE shown in ( |52] |. This is 
reasonable since only the structure of communication depends 
on the channel information. 

The special cases of the MLEs with noiseless observations 
or noiseless communications are summarized in Table |T] 

TABLE I 
The special cases of the MLEs with known and unknown CSL 



MLE 


Noiseless Obser. 


Noiseless Comm. & 
A-s>0 


Known CSI 


MRC 


BLUE 


Unknown CSI 


Subspace-based esti- 
mator 


BLUE (proper code- 
book) 



B. AF Transmission and Binary Quantization 

1) AF Transmission: Although the estimators we derived 
until now consider digital communications, they can also 
be applied when using the AF transmission, because the 
messaging function we introduced in Section |II] can describe 
the AP transmission as well. 

The messaging function for AP transmission is. 



c{x) — ax, 



(53) 



where a is the amplification gain. Since c(a;) reduces to a 
scalar, we rewritten it as c{x). 

Por AP transmission, we rewrite the energy normalization 
condition as E[c{x)^c{x)] = 1. When a = l/y^E\W[+a^, 
this condition is satisfied. Because 6 is an unknown non- 
random parameter, we cannot obtain E[6'^]. To solve this 



problem, we assume that 6* is a random variable uniformly 
distributed in [~V,+V], then we have E[0^] = V^/3.^ 
The received signal at the PC is. 



(54) 



yi = VEdhic{xi) + nc,i 

= VEdhia(0 + nsA) +nc,. 



Substituting ( |53] l and ( |54] i to the log-likelihood function 
with known CSI shown in (fTTT i. we can obtain the MLE with 
AP transmission. The derivation to obtain the MLE in this way 
is rather involved due to the cross-correlation between the real 
and imaginary parts of the received signals. In the following, 
we wiU give an alternative derivation that is simple. 

We first find a vector of sufficient statistic, then derive the 
log-likelihood function using this vector as an observation 
vector. When /ii, • • • , /ijv is known at the receiver, it is not 
hard to show that Y^ — [/i*j/i,--- , /i^yjv] is a sufficient 
statistic of Y = [yi , • • • , Hn] for estimating 6, where. 



i^l,--- ,N. (55) 



KVi = V^d\hi\ a{9 + nas) + h*nc 



The real and imaginary parts of h*yi are statistically inde- 
pendent and Gaussian distributed. We can find that the mean 



and variance of the real part is \/Ed\hi\ a9 and Ed\hi\ a ag + 
|/i,;pcr^/2, and the mean and variance of the imaginary part is 
zero and |/iipcr^/2. Ignoring the constant not associated with 
the MLE, we can obtain the log-likelihood function as, 

N 



io..fYifl^- s^( mKy^}-V£d\hr?or . 



(56) 



and the likelihood equation as. 



d\ogp{Yh\e) 
de 
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0, (57) 



where 5ft{z} and Qjz} are the real and imaginary part of z, 
respectively. 

The log-likehhood function has only one maximum that can 
be obtained by solving the likelihood equation, 
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(58) 



nl + 2Eda''M-'al 



which can degenerate to the optimal estimator proposed in 
[24| under the assumptions therein. 

The asymptotic performance of the AP transmission in 
fading orthogonal MACs are analyzed in |24| and |25l . 

'By this way, the message function c(z) can also be used to present other 
modulations such as quadrature amplitude modulation (QAM). 



2) Binary Quantization: Considering the stringent con- 
straint on the bandwidth of WSNs, many contributions assume 
that the sensors use binary quantizer as the local processor. Our 
estimators can apply when binary quantizer is used. Based 
on the system models in Section [III the messaging function 
with binary quantizer and binary phase-shift keying (BPSK) 
modulation is 



Ci(a;) 



-1, X < r 
-l,x > T 



(59) 



where r is the quantization threshold, which equals to for 
the uniform quantizer we considered. 

Substituting Ci(a;) to the likelihood function shown in ( fTTT l. 
we have. 



N 



p{Y\h,i 



Y[\og\ exp 

1=1 



/£dhi 



exp 



\yi - V^hi 
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a-F^^T-e)) 



(60) 



where i^„ , (x) is the cumulative distribution function (CDF) 
of the observation noise. 
Define 



a{yi 



and 



- exp 



exp 



(61) 



%.)=exp(- 'y' ^^'1' )(l-j^„jr-g)), (62) 



then ( |60] l can be simplified as 

N 

p{Y\h,e) - l[[a{y,)FnAr -6) + b{y,)]. (63) 

i=l 

This is the same as the likelihood function shown in (9) of 
ff\ except for the presence of the channel coefficients since 
we consider fading channels. 

VI. Discussions 
A. Transmission Codebook Issues 

When digital communications are used, the transmission 
codebooks can represent various quantization, coding and 
modulation schemes. Here we discuss the impact of the 
codebooks on the decentralized MLEs. 

We rewrite the conditional PDF with known CSI shown in 
(Unil as. 



p{y^\ht,x) = 



1 



1 



2\L 



(vro-; 



exp 



2\L 



exp 



exp 



/Tdhtc{x) 



£d\K\^ 



|y,||2 2Vrd^{h,yfcix)} 



<64) 



Comparing the conditional PDF with unknown CSI p{yi\x) 
shown in (ISTT i with p{yi\hi, x) shown in ( |64] |. we see that both 



PDFs depend on the correlation between the received signals 
Yi and the transmitted symbols c{x). With known CSI, the 
optimal estimator is a coherent algorithm, since (|64] i relies on 
the real part of the correlation, yj^c(x). With unknown CSI, 
the optimal estimator is a non-coherent algorithm, since (ISTT i 
depends on the square norm of yf c(a;). Because yf c(a;) ~ 
y/£dh*c^{xi)c{x)+n^^c{x), both MLEs depend on the cross- 
correlation of the transmission symbols, c^{xi)c{x). 

Taking digital communications as an example, if there exist 
two transmission symbols c,„ and c„ in the transmission 
codebook which have the same norm. 



CriG' 



34> 



(65) 



then p{yi\x) will have two identical extrema since the MLE 
with unknown CSI only depends on |yf c(a;)p. Such a phase 
ambiguity will lead to severe performance degradation to the 
decentralized estimator. Therefore, the auto-correlation matrix 
of the codebook plays a critical role on the performance of 
the MLE, especially when CSI is unknown. 

Many transmission schemes have this phase ambiguity 
problem. For example, when the natural binary code and 
BPSK modulation are applied to represent each quantized 
observation and to transmit, for any c„i in such a transmission 
codebook, defined as Cf„, there exists c,n' in Cm that satisfies 
Cm' ~ —Cm. Therefore, Cm is not a proper codebook. 

In order to cope with the phase ambiguity problem inherent 
in the codebook Cm, we can simply insert training symbols 
into the transmission symbols. Though heuristic, this approach 
can provide fairly good performance because the MLE can ex- 
ploit the training symbols to estimate the channel coefficients 
implicitly as we have shown. 

Since the MLEs are associated with the auto-correlation 
matrix of the transmission codebook, this allows us to enhance 
the performance of the estimators by systematically designing 
the codebook. Nonetheless, this is out of the scape of this pa- 
per Some preliminary results for optimizing the transmission 
codebooks are shown in 



B. Asymptotic Performance of the MLEs with respect to N 

We first consider the Cramer-Rao lower bound (CRLB) 
when CSI is unknown at the EC, which is. 
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where p'{Srn\S) and p"{Sm\d) is the first and second order 
partial derivatives with respect to 9, respectively. 

It shows that the CRLB of the MLE with unknown CSI 
decreases with the factor of 1/N, which is the same as 
the BLUE lower bound of the centralized estimation flTI. 
This is due to the fact that given 9, the received signals y^ 
from different sensors are statistically identical distributed and 
independent among each other Therefore, all these signals 
contribute equally for reducing the estimation errors. 

When CSI is available at the FC, given hi, the received 
signals are no longer identical distributed. In this case, the 
CRLB will depend on the channel realization, which is very 
hard to derive. However, since more information can be 
exploited for estimation, we can infer that the CRLB with 
known CSI is always lower than that with unknown CSI. In 
other words, the asymptotic performance of the MLE with 
known CSI will be no worse than that of the MLE with 
unknown CSI. 

C. Computational Complexity 

1) MLE: We take the MLE with known CSI as an example 
to analyze the computational complexity. The analysis for the 
MLE with unknown CSI is similar. 

The MLE can be found by performing exhaustive searching. 
In order to make the MSE introduced by the discrete searching 
neglectable, we let the searching step-size be less than A/N, 
Then we need to compute the value of the likelihood function 
at least AI x N times to obtain the MLE. 

The FC applies ( fT2] i. (fTsT l, and ( fT4] i to compute the values of 
the likelihood function with different 9. The exponential term 
in ( fTsl l is independent of 9, thus it can be computed before 
searching and be stored for future use. 

Given 9, we still need to compute p{Sm\9), m = 
0, • • • , A/ — 1, which complexity is 0{M), then to conduct 
M additions and M multiplications to obtain each value of 
the likelihood function. Thus the computational complexity 
for getting one value of log|5(Y|h, 9) is 0{MN). 

After considering the operations required by the exhaustive 
searching, the overall complexity of the MLE is 0{A'PN^). 

2) Suboptimal Estimator: The estimator presented in Sec- 
tion |lV]uses an iterative algorithm. For each iteration, we need 
to get Smt and its variance with ( |38] | and (|40| |. then obtain the 
estimate of 9 with W\\ . The complexity is similar to that of 
computing the log-likelihood function, which is 0{MN). 

If the algorithm converges after It times iteration, the 
complexity of the suboptimal estimator will be 0{ItMN). 

VII. Simulations 

We use the MSE of estimating 9 as the metric to evaluate 
the performance of the estimators. The observation SNR 
considered in simulations is defined as ifTSl . 



different transmission schemes. The communication SNR is 
then, 

7c-101ogio(^^). (68) 

The codebooks used in the simulations are summarized in 
Table HI] Consider the general features of WSNs that short 
data packets are usually transmitted and each sensor is of low 
cost. We use a simple error-control coding scheme, cyclic 
redundancy check (CRC) codes with generator polynomial 
G{x) = x'^ + X + 1, as an example of the coded transmission. 
Its codebook is denoted as Ctc- For comparison, uncoded 
transmission is also evaluated, which codebook is denoted 
as Cf„. We consider BPSK modulation to generate all code- 
books. Because the code length of the uncoded transmission 
is shorter than that of the coded transmission, the energy to 
transmit each symbol will be higher for the given £d- Due to 
the phase ambiguity problem discussed in Section IVI-AI we 
use the codebook with training symbols (TS) C^p whenever 
we evaluate the estimators with unknown CSI unless specified. 

TABLE II 

The Summary of the Codebooks Considered. 



Codebook 


ECC 


TS 


Modulation 




No 


No 


BPSK 


Ctc 


CRC 


No 


BPSK 


Ctp 


No 


Yes 


BPSK 



, w 

7. -201oglo( — 



(67) 



We use £rf, the energy consumed by each sensor to transmit 
one observation, to define the communication SNR in order 
to fairly compare the energy efficiency of the estimators with 



Two estimators with ideal communications are shown as 
the baseline, which MSEs can be served as the performance 
lower bound. They are BLUE which MSE is crI/N, and Quasi- 
BLUE when considering quantization noise |fT7l. The Quasi- 
BLUE bound is a more practical lower bound for comparison 
since we consider quantization for all estimators in this paper 
only except for the estimator with AF transmission. 

A. Influence of the Quantization Bit-Rate 

We first examine the impact of quantization bit-rate of 
the sensors. Three WSNs are considered, where the sensors 
in different WSNs use different quantization bit-rates. The 
three quantization bit-rates are set Xo ht K = 1, 2, and 
4, respectively. The sensors apply C(„ as the transmission 
codebook, where the length of the transmitted symbols is 
L = K. In this simulation, we let both the total energy 
and the total bandwidth consumed by the networks to be 
identical when using different quantization levels. Due to 
the total network bandwidth constraint, the numbers of the 
active sensors when K — 1, 2, and 4 are 40, 20, and 10, 
respectively. Due to the total network energy constraint, the 
energy consumed by each sensor to transmit one observation 
is also different. For example, if the transmitted energy of a 
sensor when A^ = 40 is £d, the transmitted energy is 2£d and 
A£d when A^ = 20 and 10, respectively. 

We compare the MSEs of the MLE with known CSI and 
the Quasi-BLUE lower bound for different quantization bit- 
rates in Fig. |2l It is shown that low quantization bit-rate is 
only applicable for the cases with extremely low observation 
SNR. At medium and high observation SNR levels, the optimal 
estimator with 1 or 2-bit quantization is inferior to that with 
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Q-BLUE, 4bil 
MLECSi,4bit 
Q^BLUE, 2bit 
MLECSi, 2bit 
Q-BLUE, Ibil 
MLECSI, 1bil 




Observation SNR 



Fig. 2. The MSEs of the MLE with different K. The MSE of MLE with 
known CSI is marked as MLE CSI in the legend, and Quasi-BLUE lower 
bound is marked as Q-BLUE. The communication SNR is 6 dB for 4 bit 
quantization, 3 dB for 2bit quantization, and dB for binary quantization. 



4-bit quantization under the same constraints of total energy 
and bandwidth. 

This indicates that we should design the quantization bit 
rate according to the observation SNR to reduce the resource 
consumption of the network. For high observation SNR, we 
should employ high bit-rate with reduced number of the active 
sensors. We note that similar conclusion is drawn in [ Tsl 
except that ifTSI considers error-free communications whereas 
we consider the communications over fading channels. 

B. Convergence of the Suboptimal Estimators 

We then study the convergence of the suboptimal estimators. 
Figure [3] depicts the MSEs of the suboptimal estimators as a 
function of the number of iterations. It is shown that the MSEs 
of the suboptimal estimators will converge after two iterations 
at different communication SNRs no matter if CSI is known. 



C. MSE versus the Communication SNR 

Figure |4] depicts the MSEs of the estimators with known 
CSI. Except for the estimator using AF transmission, all other 
estimators use digital communications with a 4-bit uniform 
quantizer (Af = 16). 

To demonstrate the performance gain of the proposed es- 
timators that jointly optimize demodulation and parameter 
estimation, two traditional fusion based estimators and a MRC 
based estimators are simulated. In the fusion based estimators, 
the FC first demodulates the transmitted data from each 
sensor, then reconstructs the observation of each sensor from 
the demodulated symbols following the rule of quantization, 
afterward combines these estimated observations with BLUE 
fusion rule to produce the final estimate of 6. When ECCs are 
applied at the sensors, the receiver at the FC will exploit its 
error detection ability to discard the data that cannot pass the 
error check. In the MRC based estimator, the FC first combines 
the received signals from all sensors, then demodulates the 
transmitted symbols. Finally the FC obtains the estimate of 9 
using the detected symbols according to the quantization rule. 

Except for the fusion based estimator with ECC uses code- 
book Ctc, all estimators use codebook Ct„ in this simulation. 
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Fig. 4. The MSEs of the estimators with known CSI as a function of 
communication SNR when A'^ = 10, 7s = 20 dB. In the legend, "Fusion- 
CRC and "Fiision-NoECC stand for two fusion based estimators using the 
codebook Ctc and Ct„, "MRC stand for the MRC based estimators, "Analog 
AF" stands for the MLE with AF transmission, "MLE" and "Subopt" denotes 
the MLE and suboptimal estimators, respectively, and "Q-BLUE Bound" and 
"BLUE Bound" stands for two lower bounds with ideal communications. 




Number of Iterations 

Fig. 3. The convergence of the suboptimal estimators when 73 = 20 
dB, N = 10. In the legend, NoCSI indicates suboptimal estimator with 
unknown CSI, and CSI stands for the suboptimal estimator with known CSI. 
The communication SNRs are 3 dB, 6 dB and 9 dB, which are marked in 
the legend. 



It is shown that the MLE and suboptimal estimators out- 
perform both the MRC based and the fusion based estimators. 
The MSEs of the MLE and suboptimal estimator approach the 
Quasi-BLUE lower bound rapidly along with the increasing 
of the communication SNR, whereas the suboptimal estimator 
degrades a little at low SNR. The MSE of MLE using AF 
transmission is larger than that using digital transmission, since 
AF transmission is no longer optimal in fading channels. 

According to the performance analysis for BPSK mod- 
ulation in Rayleigh fading channels Il30l . the BER of the 
transmission scheme with codebook Cm exceeds 0.15 when 
the communication SNR is lower than 3 dB. ECC can improve 
the transmission performance for high communication SNR, 
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but it causes more errors for low SNR. For the transmission 
schemes using CRC, the BER is even worse because long 
codes will reduce the transmission energy per symbol. For 
such a high BER, the fusion based estimators, especially those 
with ECCs, do not perform well. Most of the demodulated 
data will be dropped due to the error check, thus the fusion 
estimators do not have enough information to exploit, which 
finally leads to worse MSE performance. 

The performance of the estimator based on the MRC is 
much worse than the proposed estimators, which shows the 
significant impact of the observation noise. 
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Fig. 5. The MSEs of the MLEs with different level of channel information, 
where N = 10 and 7s = 20 dB. In the legend, "MLE NoCSI NoTS", "MLE 
NoCSI TS2" and "MLE NoCSI TS5" stands for the MLE with unknown CSI 
and different number of training symbols, respectively. "MLE EstCH" denotes 
the MLE with known CSI that applies hi as true value of hi . "MLE CSI (ref.)" 
is the MLE with known CSI, just for a reference. 
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the codebook Ct„ without training symbols, we also evaluate 
the codebooks with 2 and 5 training symbols. It is shown 
that if Ctn is applied as the codebook for the MLE with 
unknown CSI, the MLE exhibits rather high MSE that cannot 
be improved by increasing the communication SNR. This 
validates our analysis in Section lVI-Al that the phase ambiguity 
of Ctn will lead to severe performance degradation of the 
estimator. When we insert training symbols, the performance 
of the MLE with unknown CSI improves significantly, but it is 
still much worse than that of the MLE with known CSI at low 
communication SNR levels. It is interesting to see that using 
more training symbols do not improve the performance of the 
MLE as expected. This is because the energy for transmitting 
an observation is fixed, inserting training symbols will reduce 
the energy for the data symbols. Our simulations show that the 
best performance is obtained when Lp — 2. This is consistent 
with the observation of |j3Tl , where the optimal Lp equals 

Vk. 

To further observe the impact of different levels of CSI on 
the optimal and suboptimal estimators. Fig. |6] shows the MSEs 
of the MLE and suboptimal estimators with known CSI and 
with unknown CSI but using two training symbols. Similar to 
the estimators with known CSI, the suboptimal estimator with 
training symbols is inferior to the MLE at low communication 
SNR. However, the performance of the suboptimal estimator 
degrades less than the MLE due to the channel estimation 
errors. 

D. MSE versus the Number of Sensors 
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Fig. 7. The MSEs of the estimators with known CSI, where 7c = 6 dB 
and 7i, = 20 dB. The meaning of the legends is the same as Fig|4] 



Fig. 6. The MSEs of the MLE and suboptimal estimators with training 
symbols and with known CSI, where A'^ = 10 and 7s = 20 dB. In the 
legend, MLE and Subopt means MLE and suboptimal estimators, respectively. 



In Fig. m the MSEs of the MLEs with unknown CSI are 
shown. Two MLEs, which differently use training symbols, are 
considered. One is the MLE with training symbols as shown 
in dSTl ), and the other is the estimator as in ( [33] ), which uses 
the estimated channel coefficients as their true values. Besides 



Figure |7] and Fig. [8] show the MSEs of the estimators with 
known CSI and unknown CSI as a function of the number of 
sensors, N . We can see that the MSEs of all the estimators 
decrease at the speed of 1/A^ for large enough N, but the 
MSEs cannot approach the lower bound due to communication 
errors. Compare the MSEs of the MLEs, we can see that 
the results validates our asymptotic performance analysis for 
MLEs both with known CSI and unknown CSI in jVI-Bj From 
Fig. |7] we can observe that the proposed estimators perform 



13 



- MLE NoCSI TS2 

- MLE NoCSI TS5 

- MLE EstCH TS2 

- MLE EstCH TS5 

- MLE CSI (ret.) 

- Q-BLUE Bound 

- BLUE Bound 




14 16 18 20 



Fig. 8. The MSEs of the estimators with training symbols and with estimated 
CSI when 7c = 6 dB and 7^ = 20 dB. The legends are the same as Fig|5] 



much better than the fusion based estimators and the MRC 
based estimators. It means that the networks with traditional 
approaches must activate more sensors to achieve the same 
MSE performance as those with our estimators, which will 
lead to low energy and bandwidth efficiency. 

E. Computational Complexity of the Estimators 

To evaluate the computational complexity, we record the 
time consumed by 10,000 Monte-Carlo simulations for the 
proposed estimators with known CSI. Table |lll] shows the 
computation time in seconds at different communication SNR 
levels. The step-size for the exhausting searching of the MLE 
is set to A/N. The number of the iterations for suboptimal 
estimator is set to 2 according to the convergence analysis. 

It is shown that the computation time of the suboptimal 
estimator is much less than that of the MLE, and is almost 
invariant with the communication SNR since the number of 
the iterations is fixed. The computation time consumed by the 
MLE varies slightly, which comes from the implementation of 
the truncate exponential function in simulation codes. 

TABLE 111 

The computation time in seconds consumed by simulating the 

estimators with known csi. 



7c 


3 dB 6 dB 9 dB 12 dB 


15 dB 


MLE 
Suboptimal 


274.12 270.86 258.16 234.56 
6.80 6.88 6.78 6.58 


207.58 
6.35 




VIII. Conclusion 





In this paper, we studied the decentralized estimation for 
a deterministic parameter using digital communications over 
orthogonal multiple-access fading channels with a uniform 
multiple-bit quantizer By introducing a general messaging 
function, the proposed estimators can be applied for digital 
communication systems using various quantization, coding 
and modulation schemes and for analog communication sys- 
tems such as those using the well-studied AF transmission. 



We derived the MLEs with known and unknown CSI. 
When inserting training symbols before the data symbols, the 
MLE with unknown CSI estimates channels implicitly and 
exploits the channel estimates in an optimal way. Following the 
structure of the MLE, we designed a suboptimal estimator that 
has affordable complexity and converges rapidly. It performs 
as well as the MLE at high communication SNR and has minor 
performance loss at low communication SNR. 

Simulation results show that both the MLEs and the sub- 
optimal estimators outperform the traditional MRC based 
and fusion based estimators, and the estimators using digital 
communications outperform those using AF transmission in 
Rayleigh fading channels. Compared with the WSN that 
using binary quantization for decentralized estimation, the 
system using multiple-bit quantization has superior energy and 
bandwidth efficiency. Therefore, even with the strict bandwidth 
constraints, we suggest that the WSNs should use multiple- 
bit quantization rather than binary quantization when the 
observation SNR is relative high. 
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